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1 INTRODUCTION 



Photometric redshifts have been used since the 1960s jBauml 
1962) as a means of estimating redshifts in surveys where it 
would be impractical to obtain spectroscopic redshifts for 
all the objects observed, or where objects are too faint for 
spectroscopic techniques to be applied. Since then multi- 
colour surveys have commonly used some form of redshift 
estimation with varying degrees of success depending on the 
selection of filters used and the method of relating particular 
sets of colours to a redshift estimate. 

The importance of the technique is growing not 
only with the desire to gain a greater understanding of 
galaxy evolution (through, for example, the determina- 
tion of luminosity functions) but also in weak gravitational 
lensing, where redshift estima tes can reduce contamina- 
tion from intrinsic align ments jHevmans fc Heavens! I2OO2I 
iKing fc Schneiderl 120031), and al low the p ossibi lity of 3D 
lensing studies. iHul (Il999ll and iHeavensI (1200311 proposed 
3D lensing analyses that provide better constraints on the 
mass power spectrum and vacu um ener g y equ ation of state 
than traditional '2D' analyses. iTavlorJ (l2002f) discussed a 
method of using redshift information to reconstruct the 
3D lens ing potential, and thi s has been applied to simu- 
lations teacon fc Tavlorll2003l) and t he COMBO- 17 multi- 
waveband survey jTavlor et al]l2004D . 

In all these applications the aim is to estimate redshifts 
photometrically at magnitudes fainter than may readily be 
achieved by multi-object spectroscopy. In the case of weak 
lensing studies in particular, there is a strong motivation to 
measure the lensing signal at the faintest possible magni- 
tudes. But of course at faint magnitudes the photometric 
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measurement errors become significant and cause increased 
redshift errors. We shall show below that this leads to a 
bias in the estimated redshift distribution obtained which 
for 3D weak lensing and galaxy evolution studies could se- 
riously affect the results. In recognition of the difficulty of 
obtaining reliable photometric redshifts at faint magnitudes, 
it is common instead to assume a statistical distribution for 
redshifts which may be calculated given knowledge of the 
evolving galaxy luminosity function. 

In this paper we discuss a Bayesian approach to redshift 
estimation, examining specifically the case of a survey util- 
ising UBVRI imaging since large area weak lensing surveys 
will be expected to use such broad-band datasets. We show 
that, by adopting a prior calculated from galaxy luminos- 
ity functions, it is possible both to correct for the bias in 
the sample distribution and to obtain redshift distributions 
that smoothly converge to the prior distribution at the limit 
of faint magnitudes. We show however that there is a price 
that must be paid: for each galaxy it becomes necessary not 
to assign a single definite redshift, but rather to consider its 
entire posterior probability distribution in redshift, in order 
to avoid bias. 

The idea of applying a prior to i mprov e photometric 
redshift estimates is not new. iBemted (l2000l) has discussed 
adopting a Bayesian analysis in photometric redshift analy- 
sis and tested this using a parameterised form for the prior 
on a spectroscopic sample in the Hubble Deep Field North. 
Some improvement in redshift error was obtained, and by 
inspecting the values of the posterior probability distribu- 
tion, outliers in the distribution of sample redshifts could be 
removed. 

iBrodwin et alJ <l2003h used an iterative method to esti- 
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mate a prior that improved the estimate of the overall red- 
shift distribution, but this did not follow a strictly Bayesian 
approach and the method was shown to break down at very 
low signal-to-noise. 

In fact, consideration of Bayes' theorem leads us to ex- 
pect that it should be possible to calculate a prior which is 
indeed based on our prior knowledge, and that the applica- 
tion of this to the redshift estimation problem should not re- 
quire iteration or training. Furthermore, a full Bayesian ap- 
proach requires us to consider the complete posterior prob- 
ability distribution, and we discuss in the next section why 
this is necessary to avoid biased distributions at faint mag- 
nitudes. 



2 THE BAYESIAN METHOD 

2.1 Introduction to the Bayesian method 

There are two classes of approach to the problem of photo- 
metric redshift estimation. One is to fit colours by a range 
of templates (as employed by Hyp erz and C FDF-PRS - see 
iBolzonella. Miralles fc PelldlboOOl and iBrodwin etHI 120031 
respectively) and the second is to utilise some method of 
machin e learning such as the a rtificial neural networks of 
ANNz iCollister fc Lahavll2004D . support vector machines 
(a type of learning algo rithm for ge neral classification prob- 
lems) as used by Wadadckar ( 200 5), or polynomial fits to a 
spectr oscopic sample llConnollv. Csabai fc SzalavllT995l . and 



later iHsieh et alj f2005). Whilst the second approach as 
demonstrated by ANNz is highly effective for reasonably 
bright objects where it is practical to obtain a large training 
set of spectroscopically identified objects, we use methods 
based on the former approach as they are more readily used 
when such training sets are not available, as is the case for 
samples of faint objects. 

Suppose we have some measured colour information, de- 
noted by C, for a galaxy. Traditional template-based photo- 
z methods work with the likelihood C(C\z). A Bayesian 
approach is to consider instead the posterior probability 
p(z\C, V) <x j£(C\z)p(z\'P) where V denotes our prior knowl- 
edge and p(z\V) is the corresponding prior probability dis- 
tribution for 2. The source of prior information we wish 
to use here is the galaxy luminosity function, (p, deter- 
mined from spectroscopic redshift surveys. In order to use 
this information we need to determine the redshift distri- 
bution that results from a known luminosity function as 
a function of galaxy magnitude m and spectral type S: 
p(z\V) = p(z\m, S, <f)( m , S, z)). This prior distribution is 
a more general form of the redshift distribution that has 
previously been used i n the limiting cas e of assuming no 
colour information (e.g. lBrown et al.l2003l who assume a dis- 
tribution based on extrapolating the median redshift from 
magnitude ranges where photometric redshifts are available 
out to fainter magnitudes where they are not), and hence 
the Bayesian approach can be seen to fully encapsulate the 
transition between the high signal-to-noise regime, where 
C(C\z) is sharply defined and the prior has no effect, and 
the low signal-to-noise regime where the prior information 
dominates. 



2.2 Noise bias and choice of prior 

Before discussing the application of such a prior, we first 
need to highlight the bias that is introduced by using only a 
likelihood-based estimator in the presence of noise, and also 
to discuss the extent to which applying a Bayesian prior may 
alleviate that bias. 

Consider a sample of galaxies with a true redshift distri- 
bution n(z) and suppose those redshifts are measured by any 
technique which has some error distribution e(z \z) where z 
and z are the measured and true redshifts respectively. Then 
the observed sample distribution of redshifts is the convo- 
lution n(z) * e(z \z) which is different from, and typically 
broader than, the true redshift distribution. The difference 
between true and measured distributions is small at high 
signal-to-noise where e(z \z) tends to a delta function, but 
becomes significant when the width of the e(z \z) distribu- 
tion is comparable to the width of the true distribution. 

A Bayesian approach circumvents this problem. In prac- 
tice we measure a set of photometric data C, with proba- 
bility of obtaining that data e(C\z). We assume a likelihood 
function C(C\z), where £(C\z) = e(C\z) if the likelihood 
function is a good representation of the true process by 
which the observed galaxy data is created. We can calculate 
a normalised Bayesian posterior probability distribution 



p(z\C,V) = 



H*)£(.c\z) 

fdzT(z)£(C\. 



(1) 



But the photometric distribution of galaxies at this mea- 
sured redshift is the convolution of the true distribution with 
the error distribution as discussed above, and hence the sum 
of the normalised posterior probability distribution for the 
sample is 



dCn(C)p(z\C,V) = 



dC 



n(z) 



Hz)£(C\z) 
f dzV(z)jC(C\z 



dz'n(z')e(C\z') 



(2) 



if C{C\z) = e(C\z) and we choose V(z) — n(z). Hence the 
posterior probability distribution may be used to create sam- 
ple redshift distributions which are unbiased provided 

(i) the likelihood function is an accurate reflection of the 
true statistical process (this requirement also applies to 
likelihood-based methods of course) 

(ii) the prior is sufficiently well-known. 



2.3 Choice of estimator 

From the above, we can see that a likelihood-based esti- 
mator (either maximum likelihood or minimum variance) 
should give adequate results in the limit of high signal-to- 
noise where the effect of ignoring the prior is unimportant. 
We shall argue below that at faint magnitudes, however, the 
Bayesian posterior probability distribution should be used. 
This makes the process of utilising photometric redshifts at 
faint magnitudes more problematic, as we now no longer 
have a single estimated value, perhaps with some quoted 
uncertainty, but instead have a continuous probability dis- 
tribution. 

One approach would be to replace, say, a maximum 
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likelihood estimator by a maximum poste rior prob ability 
(MPP) estimator or similar, as advocated bv lBenfte3 (2000). 
Such an estimator, which would simply return maxp(z) 
rather than max£(z) would still be biased, however, albeit 
differently from the ML estimator. In the limit where the 
prior dominates over the likelihood function, the MPP esti- 
mator would place all galaxies at the same redshift, at the 
peak of the prior distribution. So we can see that the ML es- 
timator tends to produce a sample distribution which is too 
broad, an MPP estimator a sample distribution that is too 
narrow, with only the full posterior probability distribution 
encapsulating the information in an unbiased way. 

These considerations lead to the supposition that there 
might exist an estimator which would create an unbiased 
sample distribution, which would be obtained by applying a 
prior somewhere between the uniform case assumed in the 
ML estimator and the full Bayesian prior. In the case of a 
simple distribution, such as the normal distribution, it is 
straightforward to calculate such a prior. For more complex 
distributions it may only be evaluated numerically. We refer 
to this as a "sample prior" , but in light of the evalutation of 
the alternative technique presented in this paper we do not 
believe that this is a useful approach due to the difficulties 
involved in correctly calculating such a prior from Monte 
Carlo simulations. This technique is discussed in more detail 
in Section Qj] 



3 APPLICATION TO A TEST DATASET 
3.1 The COMBO-17 survey 

To test whether the ideas discussed above are applicable 
to real datasets, and to measure the difference in results 
that may be obtained, we test the method on p hotometric 
data from COMBO-17 survey J Wolf et aUl2004h and spec- 
trosc opic redshifts obtained from VVDS Fevre et alJ 
l2004f) . with the two datasets overlapping on the Chandra 
Deep Field South (CDFS). COMBO-17 has imaging in 17 
optical filters from the Wide Field Imager on the MPG/ESO 
2.2m telescope sited at the La Silla observatory in Chile, and 
reaches depths of R < 25.4 (10<r limit for the CDFS, Vega 
magnitude). The full 17-band COMBO-17 photometric red- 
shifts have accuracies of Sz/(1 + z) ~ 0.025 for R < 23 (ap- 
proximately 2000 objects) rising to 0.06 for R < 24 (10,000 
objects). VVDS is a spectroscopic survey of 11,564 objects 
over 0.6 square degrees using VIMOS on ESO-VLT UT3. 
The sample is selected purely by 17.5 ^ Iab ^ 24. 

We use here a sample of 671 galaxies, selected to have 
photometry from the CDFS field of COMBO-17 and spec- 
troscopic redshifts from VVDS. We select only VVDS galax- 
ies with redshifts assigned such that more than 95 per cent 
are correct (corresponding to VVDS redshift flags of 3, 4, 23 
or 24). Whilst lower- confidence VVDS redshifts would be 
acceptable in many other applications where redshift errors 
might average out, we choose to be highly conservative in 
order to ensure that our assessment of photometric redshift 
errors are not distorted by errors in our spectroscopic sam- 
ple. The distribution of these galaxies in R and z is shown 
in Fig. [J The median R of the sample is 22.6, with 25th and 
75th percentiles at R = 21.8 and R = 23.3 (Fig.0. 

Of the 671 VVDS galaxies 21 do not have colours close 



1 .5 r 



1.0 - 



0.5 - 



0.0 L 



16 



20 22 
R magnitude 



24 



26 



Figure 1. VVDS galaxy spectroscopic redshifts and magnitudes, 
limited to 2 < 1.5 (excluding 0.3 per cent of the sample) 
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Figure 2. VVDS magnitude distribution, i?-band aperture mag 
nitudes. 25th, 50th and 75th percentiles marked. 



to those of a template and produce problematic redshift es- 
timates as a result, and are therefore excluded. Two of the 
remaining objects have a true redshift beyond z = 1.5, and 
are also excluded from the set, bringing the total removed to 
3.4 per cent. Three further objects are at 1.4 < z < 1.5, and 
whilst strictly beyond the limit of the estimation templates 
are quite close and are still included. 

In addition, COMBO-17 provides a set of galaxy lumi- 
nosity functions for three broad galaxy types, derived from 
all three fields. These types are separated by rest-frame 
U — V colour-cuts evolving with redshift, with a cut be- 
tween blue and red populations and a further cut between 
two subsets of the blue population (Table |5J. An earlier 
variant of t he calculation of t hese luminosity functions is 
describ ed in IWolf et al. | ]2005t and a more recent calcula- 
tion in lFaber etHuTifeoOsll . which describes functions much 
like those outlined here but where the two Blue types are 
combined into a single set. The three types used here are 
separated by rest-frame U — V colour-cuts evolving with 
redshift, with a cut between blue and red populations and 
a further cut between two subsets of the blue population 
(Table This data is used in the construction of the prior 
itself (see Section 13. 2.3^ . 
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Table 1. Galaxy luminosity function parameters. M* is 1 
rest-frame Johnson B. 
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Table 2. Colour cuts between galaxy populations ex 
terms of (U - V) Icst + 0.08 X (M v + 20). 



3.2 Estimation technique and construction of the 
Bayesian prior 

3.2.1 Spectral models 

Galaxy spectral models are t aken from P EGAS E stel- 
lar p opulation synthesis models iFioc fc Rocca-Volmerangel 
Il997f) as detailed in the COMBO-17 CDFS reference pa- 
per IWolf et alJl2004h . By shifting the template wavelengths 
and passing through the appropriate filter models colour 
templates are obtained. In all 360 different spectral energy 
distribution templates (SEDs) are shifted over 177 intervals 
equidistant in log(l + z) running from z = to 2 = 1.4. 

The redshift range is restricted to z < 1.4 owing to a 
lack of high contrast spectral features in the bands used (a 
problem that could be alleviated by observing in the near 
infra-red) , exacerbated by noisy photometry in the typically 
faint z > 1.4 objects, both of which severely limit the ability 
to make accurate estimates. 

Spectr al models for st ars and quasars (based o n tem- 
plates from |Picklesll998l and lvanden Berk et all200ll respec- 
tively) are also used. A decision is made into which of these 
three classes an object should fall by normalising the total 



likelihoods for each by the number of templates available 
in each class, multiplying by an overall class prior based on 
/-band magnitudes and choosing the class with the highest 
normalised likelihood. 



3.2.2 Template fitting 

In order to best allow for the effect of errors in the mea- 
surement it is preferable to perform a fit in terms of flux, 
rather than magnitude, as the error is symmetric in the for- 
mer but not in the latter (for the case where background 
noise dominates - the low signal regime at which the error 
becomes most significant for photometric redshift estima- 
tion), and this asymmetry becomes important as the size of 
the error term rises. A natural way to express the fit to a 
set of observed fluxes f to a set of template fluxes f would 
be 



p(z,S\F,V) =p(z,S\V)e X p 



f f 



(3) 



However we do not have a set of template fluxes f - we only 
have a set of colours, which are equivalent to template flux 
ratios. We can convert the above to an expression involving 
these flux ratios if we arrange the ratios in terms of Jr as 
Aj = fi/fn, an d using f = JrX and marginalising over all 
}r from to 00 giving 



p(z,S\f,V) cx / P (z,S\V)e X p 



f-fa\ 
07 



df R . 



(4) 

It is this equation that is evaluated to obtain our photo- 
metric redshifts (although the integration can in practice 
be reduced to the range where fn is within a few a/ of /r 
since Xr = 1 and the value of the integrand approaches zero 
outside this region). 

The resulting p(z, S\f,V) can be marginalised over the 
SED range S to obtain a redshift estimate (e.g. by selecting 
the maximum likelihood (ML) or minimum error variance 
(MEV) estimate). The standard deviation of the posterior 
probability function can also be used to give an idea of the 
error in redshift resulting from the photometric error (the 
<jf term in equation 2] at low signal to noise). 

In likelihood-based methods, it is common to use a min- 
imum error variance (MEV) estimator (the mean redshift 
from the likelihood function) rather than a maximum like- 
lihood (ML) estimator. This estimator minimises the av- 
erage deviation of photo-z values from the true redshifts 
iWolf et al]l200ff) . We argue here that in low signal-to-noise 
situations it is best to avoid this step and adjust one's anal- 
ysis to work with p(z) rather than any single estimation for 
an object. In the high signal-to-noise situation the resulting 
p(z) can still be used to obtain such estimates if desired. 



3.2.3 Constructing a prior probability distribution 

To make use of the Bayesian approach we need a method of 
calculating a prior probability distribution V which incor- 
porates our knowledge of the expected redshift distribution 
at a given apparent magnitude. 

Our prior probability is precalculated for each combi- 
nation of a range of apparent magnitudes (tailored to fully 
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cover the range expected for galaxies in a survey), for each 
SED and for each redshift. At every (z,S,R) location (of 
177 points in z, 360 in S and 50 in R) the absolute mag- 
nitude M necessary to result in an observation at R given 
a ACDM cosmology (Qm = 0.3, £Ia = 0.7) is calculated, 
using K-corrections derived from the same galaxy templates 
used to determine photo-z colours. The three variables M, z 
and S are used to derive the value of the galaxy luminosity 
function (GLF) and this is converted to a density per unit 
solid angle on the sky and per redshift interval (given the 
log(l + z) spacing of the redshift points), again for a ACDM 
cosmology. Allowing for the fact that R is the corresponding 
magnitude for the model flux jr, the resulting prior func- 
tion V(z\S, R) found from this table can be directly used in 
equation 0] . 

The COMBO- 17 GLFs are calculated at redshift bins 
centred on z = 0.3,0.5,0.7,0.9 and 1.1. Photometric red- 
shifts (utilising all 17 bands) and derived K-corrections are 
used to determine these GLFs. The functions used in the 
prior are calculated for arbitrary redshift by taking pa- 
rameters (M* ,(f>* ,a) fitted by simple functions to the five 
COMBO-17 values, accounting for their respective error 
bounds (except in the case of a which was held constant 
in redshift but not colour type when the luminosity func- 
tions were originally fitted to the COMBO-17 data). These 
functions are a second order polynomial in log (f>* and a lin- 
ear fit in M* , chosen for a low-order but good fit through 
the points. In the case of (f>* the fit to the logarithm was 
chosen to prevent the galaxy number density from taking 
on non-physical negative values. Furthermore the evolution 
of M* is halted at redshifts greater than 1.0. Th i s con - 
straint on the evolution is consistent with ICohenl J2002T) 
who gives log L*(R) = 37.37 ± 0.25 at 0.8 < z < 1.05 and 
log L* (R) = 37.30 ± 0.40 at 1.05 < z < 1.50. Illbert et alJ 
(2004) also find no evidence of Mr evolving beyond 2=1, 
except for some slight brightening in the case when a is 
fixed, with most evolution occurring in bluer bands. 

The galaxies in each type are assumed to be spread 
evenly within the SED range covered by the original colour 
cuts separating each population when converting between 
the individual 360-point SED classification scheme of the 
templates and the summed 3-population classification of the 
GLFs. 

Whilst a prior constructed in this manner is technically 
cosmology-dependent, being founded on a ACDM luminos- 
ity distance, this is the same cosmology originally used to 
determine the GLFs from COMBO-17, and therefore much 
of the dependence on the cosmological model cancels out. 
This leaves only the potential for small differences due to 
fits to the parameter evolution and is not a concern com- 
pared to the uncertainty already present in the GLFs. 



3.3 Comparison of likelihood and Bayesian 
methods 

3.3.1 Statistical tests 

We examine the effectiveness of the photometric redshift 
estimation in three ways. We define sensitivity as the frac- 
tion of galaxies in some spectroscopically-measured redshift 
range that that have photometric redshift correct within 
some tolerance. However, we are also interested in the frac- 
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Figure 3. Minimum error variance (MEV) photometric redshift 
plotted against VVDS spectroscopic redshifts. Grey points repre- 
sent those estimated with no magnitude prior, black points those 
estimated with the application of a magnitude prior. Limited to 
R < 23. 



tion of galaxies at some estimated redshift that are correct 
within a specified tolerance. Following the terminology used 
in clinical trials, we call this statistic the specificity. 

Finally, we have already discussed the possibility that 
estimations may be biased, and we examine this possibil- 
ity by considering the mean expected photometric redshift 
for samples binned by spectroscopic redshift and how the 
posterior probabilities vary with spectroscopic redshift. We 
also compare redshift distributions for samples of galaxies 
in various magnitude ranges in Section 2] 

3.3.2 Comparison at bright magnitudes: MEV estimation 

At sufficiently bright magnitudes, R < 23, we may com- 
pare directly photometric redshift estimates with spectro- 
scopically measured redshifts from VVDS. This comparison 
has the great strength of being based on a known "ground 
truth", but unfortunately does not probe to the faintest 
magnitudes that are of interest in weak lensing studies. 

Fig-CUsliows MEV photometric redshift estimates based 
on U BVRI data where R < 23 for both a standard template 
technique and with a Bayesian prior applied. The standard 
template technique used is an MEV estimate equivalent to 
using an uninformative flat prior so that 



C oc 



/ 

Jo 



exp 



(5) 



where we again marginalise over the model flux fa- 

For this set of brighter objects (60 per cent of the sam- 
ple), redshift estimates are largely unchanged. Some outlier 
objects are have improved redshifts, but equally some are 
scattered further from the true value. The overall standard 



deviation of z p ^ 



for each technique is not substan- 



tially changed, being 27 per cent worse for the non-Bayesian 
case, but this is predominantly due to changes in outlier es- 
timations. 

As signal-to-noise decreases there is a substantial in- 
crease in the advantage gained by applying a prior, as the 
width of the p(z) function decreases (Fig. [IJ. This is more 
substantial at high redshift since under the standard likeli- 
hood approach fainter objects (which dominate at high red- 
shift) will produce a broader C(z), and it is in this region 
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Figure 4. Mean cr z /(l + z) (standard deviation of p(z)) in the 
case of no prior (upper line) and in the case of a magnitude prior 
(lower bold line). Results are binned in redshift intervals of 0.1 
from z=0 to z=1.4 for objects where R < 24.5 



where the Bayesian approach provides the majority of the 
information. This effect is seen more clearly when data are 
binned by R magnitude in Fig.EJwhere we also see the actual 
52/(1 + z) of MEV estimates. The relatively low Sz/(1 + z) 
errors at the faintest magnitudes may be due to the high- 
confidence spectra from VVDS at such faint magnitudes be- 
ing dominated by objects with particularly high-contrast 
spectral features suitable for making unusually good pho- 
tometric redshift estimates, and should not necessarily be 
relied upon as being representative of the errors for faint 
objects in general. 

For these MEV estimates we have removed redshift off- 
sets of 0.028 x (I + 2) and 0.023 x (l+a) for the non-Bayesian 
and Bayesian sets respectively. Such offsets are a result of 
the systematic effects described below and are dependent on 
various aspects of the photometry calibration. They can not 
be relied upon to be field independent and would in practice 
require some spectroscopic calibration to correctly remove. 

In addition, we have eliminated outliers where z/(l + z) 
has been misestimated by more than 3 standard deviations, 
calculated iteratively in unit magnitude bins from 18.5 to 
24.5 (eliminating also two objects beyond 24.5). 51 outliers 
are eliminated in the non-Bayesian approach and 37 in the 
Bayesian approach. The error estimated from the width of 
the p(z) distribution underestimates the true redshift error 
at high signal to noise, where the largest sources of errors are 
systematic effects such as relative calibrations of the pass- 
bands and variations in object spectra that are not repre- 
sented in the templates. At lower signal to noise the range 
of possible colours covers the templates well enough that 
template incompleteness and passband uncertainties are less 
significant and allow for more accurat e estimations of error 
based on the p(z) width alone (see also lWolf et al.l2004l for a 
more detailed discussion of the properties of such estimation 
errors). In the case of the sample used here this transition 
to noise-dominated errors occurs at -R < 23 and is the rea- 
son the R < 23 cut was chosen in the examination of MEV 
estimates earlier. 

Sensitivity and specificity for UBVRI MEV estimates 
with and without a prior is given in Fig. El for the 400 ob- 
jects with R < 23. Outliers are included again, and we use 
a varying tolerance proportional to 1 + z (as a fixed reso- 
lution SX/X translates to fixed Sz/(1 + z)). Overall 73 per 
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Figure 5. RMS redshift error as <5z/(l + z) as a function of 
R magnitude without prior (thin) and with (thick) along with 
width of probability functions scaled by 1/(1 + 2) (dashed lines), 
binned in 0.5 magnitude intervals from R = 18.5 to R = 24.5, 
< z < 1.4. Outliers, defined as objects more than 3s. d. (cal- 
culated iteratively) from the mean Sz/(l + z), are excluded (6 
and 8 per cent for the Bayesian sample and non-Bayesian sample 
respectively) . 

cent of objects have redshifts correct to 0.05 x (1 + z) in 
the non-Bayesian case and 78 per cent in the Bayesian case. 
92 per cent have redshifts correct to 0.1 x (1 + z) in the 
non-Bayesian case, and 94 per cent in the Bayesian case, 
showing that single-valued estimates of redshifts are largely 
unchanged at brighter magnitudes. Error bars are estimates 
based on the assumption that objects fall in or out of the 
tolerance range according to the binomial distribution and 
are shown in order to give a better idea on the constraint 
of the true tolerance and can not indicate whether the tech- 
niques are significantly different. Performance by magnitude 
is also shown in Fig. El again showing broadly similar per- 
formances between the two methods for MEV estimates at 
brighter magnitudes, showing that gains are starting to be 
made as we approach the point at which these estimates will 
start to break down. 

3.3.3 Using the full posterior probability distribution 

At low signal-to-noise it becomes appropriate to avoid single- 
valued estimations (see Section . To compare sensitivity 
and specificity in the case where we aim to use summed nor- 
malised posterior distributions we define each as the percent- 
age of the distribution within the given range, so sensitivity 
is defined as 

E J Pi{z)Wi(z)dz 

Ni { ' 

where i is the set of objects in a given bin of spectroscopic 
redshift, and specificity is defined as 

e,/; b p,(^.(^ 

EJZPi(z)dz U 

where i is the set of all objects, za and zb denote the limits of 
the bin in consideration and we use a window function Wi(z) 
as an additional constraint on the limits of the integrals so 
that 

Wi{z) = 1 where \z - z spcCi \ < T(l + 2 sp ecJ 
= at all other points 
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(a) (b) (c) 

Figure 6. (a) Sensitivity, (b) specificity and (c) performance by magnitude of MEV photometric redshifts for R < 23. Upper black lines 
show tolerance of 0.1 X (1 + z) and lower grey lines show tolerance of 0.05 X (1 + z). In each case the thin line represents results before 
a prior is applied and the thick line those afterwards. Error bars are estimates under the assumption that objects fall in or out of the 
tolerance according to the binomial distribution. In the case of performance by magnitude the concept of sensitivity and specificity do 
not apply as they are a result of binning by either photometric or spectroscopic redshift. 



for a given tolerance T. With these definitions sensitivity 
and specificity are sums over equivalent regions in z spec and 
z to our earlier definitions. 

Furthermore we define a performance binned by mag- 
nitude as 

Ni (8) 

where i is the set of objects in a given magnitude bin. 

Sensitivity, specificity and performance plots in this 
case are given in Fig. Q Sensitivity shows a small gain at 
almost all redshifts, and differences in specificity are also 
generally small. Overall performance is 49 per cent within a 
0.05 x (I + 2) tolerance without a prior and 53 per cent with, 
and 76 per cent within a 0.1 x (1 + z) tolerance without a 
prior and 81 per cent with. 

Fig. |5] show the 25th, 50th and 75th percentiles of 
summed p(z) distributions in the cases without a prior and 
with a Bayesian prior over a range of spectroscopic red- 
shifts and gives a clearer indication of regions where the 
prior has most effect. At low redshifts where objects tend 
to be brighter there is little difference but at higher red- 
shifts and for fainter objects the estimates become much 
broader and generally occupy a range from about z » 0.7 
upwards. The Bayesian estimates help resolve a bimodality 
with much lower redshifts around this region (which is the 
cause of the 25th percentile of the non-Bayesian distribution 
lying at such low redshift) and give a higher proportion of 
the estimation to the higher redshifts, although neither ap- 
proach gives a particularly good match between the median 
distribution redshift and the spectroscopic redshifts. 

As well as reproducing the distribution as accurately as 
possible, we must also examine whether selections from this 
distribution are unbiased representations of the true under- 
lying redshifts. Clearly from examining Fig. |H| there is some 
degree of bias present. To check the severity of the bias, 
we also examine the mean photometric redshift in the two 
MEV cases and the mean of the summed posterior proba- 
bilities and the width of this distribution in the Bayesian 
case also in spectroscopic redshift bins of width 0.1 (Fig.|S|. 
We perform a least-squares fit to determine a bias b where 



1.5 




0.0 0.5 1.0 1.5 

Spectroscopic redshift 

Figure 8. Plots of the 25th, 50th and 75th percentile positions of 
summed posterior probabilities in redshift bins of 0.1 for R < 24.5 
in the case of no prior (thin lines) and a Bayesian prior (thick 
lines). Dashed lines indicate a line of z p hoto = Zspec and a region 
±0.1 X (1 + z) tolerance. 



(1 + 

Zphoto 

) = b(l + z S p C c), weighting by the number of ob- 
jects in each bin and inversely to the variance of MEV es- 
timates or the width of the mean p(z) estimate squared. 
Since the ability to make a reasonably unbiased estimate 
appears to break down above 2 ~ 1 as noise increases and 
the 2 < 0.1 contains too few objects to rely upon we con- 
sider 0.1 < 2 < 1.0 only. Non-Bayesian MEV estimates give 
a fit of b = 0.976 ± 0.003, Bayesian MEV estimates give 
b — 0.977 ± 0.003 and summed posterior probabilities give 
b — 0.978 ± 0.003, indicating that there is no significant 
change in bias between the three approaches in that red- 
shift range. 

As well as the bias in each bin, the spread in redshift 
estimates is also a concern. In Fig. [S] in the Bayesian case, 
half the probability distribution is contained within a region 
of 8z » 0.05 at 0.1 < 2 < 0.2 rising to 0.21 at 0.9 < 2 < 1.0. 
At higher redshifts the width is less stable, ranging from 0.28 
to 0.45 at 2 > 1.0. In comparison, the non-Bayesian case has 
Sz ta 0.04 at 0.1 < 2 < 0.2 rising to 0.35 at 0.9 < 2 < 1.0, 
and from there as high as 1.03 at higher redshifts. 

A uniform distribution of redshifts in these bins would 
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Figure 7. (a) Sensitivity, (b) specificity and (c) performance of photometric redshifts, full posterior definition, R < 24.5. Upper solid 
lines show the percentage of the L(z) or p(z) function within a tolerance of 0.1 X (1 + z) and lower dashed lines show tolerance of 
0.05 X (1 + z). In each case the thin line represents results before a prior is applied and the thick line those afterwards. 
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Figure 9. Mean estimated redshift for R < 24.5 binned by spectroscopic redshift (a) without prior, (b) and (c) with prior. Error bars 
represent the la spread in MEV estimates for (a) and (b) and the mean width of the summed p(z) for (c). Lines indicate z = z spcc where 
there would be no bias. 



have half the distribution in Sz = 0.05, so there is a spread 
as would be expected due to the errors arising from esti- 
mating photometric redshifts, which presents a concern. In 
addition, there are significant outlier peaks in the probabil- 
ity distribution (some of which are picked up in Fig. |H| by 
the 25th percentile of the non-Bayesian distribution) which 
could disrupt an analysis, but the impact these regions have 
is reduced in the Bayesian approach, and this advantage is 
strongest for the faint galaxies where traditional methods 
break down. 



Fig- EJshows this also for MEV estimates, although the 
error bars here are influenced by the outlier regions men- 
tioned above. In particular, the low redshift end shows larger 
error bars than might be expected from Fig. |H] as the sec- 
ond moment of the distribution is more sensitive to these 
outliers than the percentile positions. The MEV estimates 
also show less spread than the full posterior approach due 
to undersampling of the tails, and whilst this might appear 
initially a useful property we show in Section^Jthat this can 
be highly problematic as signal-to-noise deteriorates. We re- 
turn to this issue, and that of bias and higher order issues 
of the distribution in weak lensing analyses in Section |S| 



3.4 Effects on redshift estimates from errors in 
the prior 

There is a concern that the prior may not be the correct 
choice due to both measurement errors in the luminosity 
functions used in its calculation, and also due to system- 
atic effects that might arise from the choice of fitted models 
to the luminosity functions and the treatment of the three 
types of galaxies used in the luminosity function calculation 
(such as the choice of spreading the luminosity function den- 
sity evenly across the SED ranges). 

A full analysis of the effect of measurement errors on the 
prior is computationally infeasible, and similarly attempt- 
ing to make a full assessment of the impact of any effects 
of modelling choices and systematic effects arising from the 
method used to assign the luminosity function across SED 
types would be similarly infeasible. However, some assess- 
ment of the errors that would arise can be made by investi- 
gating a few special cases. 

We have reanalysed the above sections with four further 
priors. Two of these have a faint end slope a of the red 
galaxies adjusted by ±1(7, and two have the z = 1.1 value 
of (f>* of the red galaxies adjusted by Ha. We adjust these 
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parameters in preference to any others as the parameters 
most likely to make large changes to the prior. 

High redshift values are chosen since adjustments to 
the low redshift parameters will have less effect, since most 
objects will be expected at higher redshift except for brighter 
objects which will already have well-constrained estimates 
and which will therefore be insensitive to the form of the 
prior. 

Also, an overall shift in (f>* affects the normalisation of 
the prior only, so we vary the value of this parameter for one 
class of galaxies rather than for all. 

We ignore the variation in M* typically has a lower 
impact on the resulting (j> than the value of <j>* does, and 
also since evolution of this parameter is halted at z — 1.0 
the models are more limited in the effect that variations in 
the z = 1.1 point may have. 

These four priors produce very little difference in the 
overall performance of the redshift estimates, with changes 
on the order of tenths of a percent corresponding to just 
a few objects falling in or out of the 0.05 x (1 + z) and 
0.1 x (1 + z) tolerance bounds. For MEV estimates all priors 
have 78% within 0.05 x (1+z) and 94% within O.lx (1+z) for 
all but the prior with a raised, which performs marginally 
better at 95% (R < 23). With the full priors the figures are 
53% for all priors at 0.05 x (1 + z) tolerances and 81% for 
the original and a — la priors and marginally better with 
82% for the a + la and 4>*(z = 1.1) ± la priors (R < 24.5). 

This lack of significant difference can be partly ascribed 
to the fact that these are all brighter objects, and we exam- 
ine this issue again in section |S| when we use simulations as 
deep as R < 25.5 when this issue can be expected to have a 
greater impact. These results also of course only vary one pa- 
rameter by la, and in reality some noise will apply to every 
parameter, but the small changes from varying these param- 
eters which were chosen for maximum effect would not be 
expected to rise to problematic levels, and future constraints 
on luminosity function parameters can be incorporated to 
reduce the impact of the lack of knowledge we have of the 
form of the true underlying distribution of galaxies. 



4 PHOTOMETRIC REDSHIFTS WITH HIGH 
NOISE PHOTOMETRY 

Perhaps the most significant benefit of a full posterior ap- 
proach is in the case where redshift estimates traditionally 
break down owing to photometric noise and L(z) becoming 
increasingly flat. In this situation one might conventionally 
set aside all objects with such poor photometry and in a 
weak lensing analysis simply estimate a single redshift dis- 
tribution for all such objects. As faint objects are both more 
numerous and are generally at higher redshifts than more 
luminous ones the need to deal with them in an optimal 
manner is of greater concern than attempting to improve 
the already reasonably well-constrained redshifts of brighter 
objects. 

The VVDS sample does not provide enough objects at 
a faint enough magnitude to study this effect so instead we 
consider a sample of 28,597 objects from the CDFS field of 
COMBO-17 with 17 < R < 25.5 and is classed by COMB O- 
17 as a galaxy based on photometry jWolf et al.ll2004l de- 
scribes this in more detail). Some objects have photometry 



that does not fit the templates sufficiently well and have 
extremely low likelihoods that produce a p(z) distribution 
that due to computer rounding is zero at all redshifts, and 
these are excluded. There are not in general spectroscopic 
redshifts available for this sample, and for fainter objects 
reliable photometric redshifts are not available either (even 
when using all 17 bands). However, we can consider the dis- 
tribution of photometric redshifts and compare them to ex- 
pectations. Plots of the distributions of U BV Rl-based MEV 
estimates along with the summed posterior probabilities and 
an expected distribution derived from the prior in three mag- 
nitude ranges are shown in Fig. 1101 along with 17-band MEV 
estimates from COMBO-17 for the brightest set. 

For brighter objects the estimates all converge on a 
result that inclu des more details of the large-scale st ruc- 
ture in the field JGilli et al.ll2003L iLe Fevre etafll2004l and 
IWolf et aljl2004l describe the CDFS as having rather pro- 
nounced large-scale structure). The peaks at z w 0.15 and 
z ~ 0.6 are both seen in the distribution of COMBO-17 
photometric redshifts for example, although they are not so 
well resolved in the broad-band only estimations. The ad- 
justments made by the prior are either small or, in the case 
of outlier correction, too rare to have a large effect on the 
overall distribution. 

In the faint regime of R > 24 the MEV estimates are 
clearly distorted by the inability to estimate a high redshift 
from a very broad L(z) function (since L(z) = at z > 1.4) 
and to a lesser extent the same effect at low redshift, and as 
a result the estimates cluster at the centre of the range of 
allowed estimates giving a distribution that is too sharply 
peaked. In the non-Bayesian case the relative coincidence of 
the peak of the distribution with that of the expected distri- 
bution is not due to any fundamental link between the two 
but that the expected distribution's peak happens to fall 
near the centre of the estimation range. Even in the case of 
Bayesian MEV estimates the p(z) distribution is still broad 
enough that the estimates cluster strongly in the central 
region of the estimation. These failures to make a good esti- 
mate of the distribution make MEV estimations unusable for 
faint objects. Maximum-likelihood (or maximum-posterior- 
probability) estimators would suffer a similar problem (al- 
though in the non-Bayesian case we would not expect the 
distribution to be so sharply peaked, it would still have little, 
if any, connection to the true distribution) . Having a distri- 
bution that is affected by this issue, whether as extremely as 
demonstrated in Fig. 1101 or to a lesser degree, would cause 
a bias in estimates of cosmological parameters from weak 
lensing measurements. 

In contrast, the summed posterior distribution does not 
lose the contributions at the extremes and follows the ex- 
pected distribution but with some modulation from any in- 
formation that can be obtained from the photometry. Most 
importantly the severe bias in the distribution is eliminated 
and whilst there may be some present from the prior itself 
being generally biased compared to the true distribution, 
this could be reduced by constructing new priors as infor- 
mation on luminosity functions improves. We have there- 
fore in the summed posterior probability approach moved 
smoothly from a well-constrained photometric redshift dis- 
tribution towards a broadly estimated distribution providing 
a unified approach without having to decide upon a limit- 
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Figure 10. Estimated n(z) distributions from a non-Bayesian MEV estimation (dashed line), Bayesian MEV (thick dashed line), 
summed posterior probability approach (thin continuous line) and a pure prior based solely on knowledge of the magnitude selection 
(thick continuous line). The distributions arc for (a) 17 < R < 23 (3,390 objects), (b) 23 < R < 24 (5,285 objects) and (c) 24 < R < 25.5 
(18,885 objects). The fluctuations in the prior and summed posterior probability are slight artifacts from the process of calculating the 
prior distribution. For the 17 < R < 23 set COMBO-17 MEV estimates are also included, shown as a solid grey line. 3 per cent of 
the sample do not receive a COMBO-17 estimate, and for the fainter samples this proportion rises considerably, so we do not show the 
distributions in these cases. All three sets of MEV estimates arc binned in rcdshift intervals of 0.02. 



ing accuracy of photometric redshifts beyond which we deal 
separately with any objects with poor photometry. 



5 SAMPLE PRIORS 

An alternative approach to dealing with this problem of bias 
is the "sample prior" described in Section [2.31 - a choice of 
prior that does accurately produce a set of estimates that 
reproduces n(z) accurately, albeit at the cost of individ- 
ual redshift estimates. This sample prior would have to be 
constructed through Monte Carlo simulations. By using the 
Bayesian prior to produce a sample set of simulated objects 
with a suitable overall distribution with colours and mag- 



nitudes based on the photometric redshift templates, and 
scattering these colours by an amount that might be ex- 
pected from typical photometric errors we can construct a 
large simulation of a multicolour survey. These can be fed 
through a standard non-Bayesian redshift estimation algo- 
rithm which will scatter the original distribution to some es- 
timated distribution and a suitable prior can be constructed 
that would convert the resulting estimated distribution back 
to the original. 

This presents a practical problem in that the space to be 
sampled is very large, but since we expect n to vary smoothly 
with both z and R, and to a lesser extent SED we can still 
produce an attempt at a sample prior through fitting poly- 
nomial functions to the distributions to smooth over un- 
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17 < R < 25.5 



z 


MEV 


MEV 


Full 


Prior 




non-Bayesian 


Bayesian 


posterior 


alone 


0.3 


9.96 


11.87 


10.78 


9.56 


0.5 


4.02 


5.04 


4.96 


4.67 


0.7 


2.01 


2.48 


2.71 


3.05 


17 < R < 23 


z 


MEV 


MEV 


Full 


Prior 




non-Bayesian 


Bayesian 


posterior 


alone 


0.3 


7.83 


7.51 


7.79 


7.49 


0.5 


3.39 


3.30 


3.60 


4.21 


0.7 


5.03 


4.57 


4.11 


3.74 



Table 3. Critical surface mass densities in two magnitude 
ranges for varying lens redshifts and distribution estimates, 
lO 8 M /ikpc- 2 



dersampled regions. An attempt at constructing a sample 
prior in this way succeeds in producing a distribution esti- 
mate approximately equivalent to a non-Bayesian approach 
but not as good as the summed posterior approach. Whilst 
this might be improved upon by constructing an improved 
prior the computational difficulty involved would outweigh 
the marginal gains that might be expected over the summed 
posterior estimation. 



6 EFFECT ON LENSING CALCULATIONS 

To illustrate the effect of distortions in redshift distribution 
estimations on weak lensing studies we consider the pre- 
dicted value of the critical surface mass density E cr it for 
lensing, which provides a scale by which the dimensionless 
mass surface density k can be converted to an estimated sur- 
face density. For a particular combination of lens and source 
redshifts, this critical density is given by 



E C rit — 



^GD d 



ft 



where 



(3 = (D s / Dds) 



(9) 



(10) 



with Dd, Dds and D 3 being the angular diameter distances 
between observer and lens, lens and source and observer and 
source respectively. 

The CDFS galaxy sample selected here has a distribu- 
tion of redshifts, and hence for a given lens redshift, the 
value of E cr it is a weighted average over the distribution. 
At faint magnitudes the actual distribution is unknown, but 
we can illustrate the effect of the differences between the 
redshift estimators discussed above by calculating E cr it for 
each of the estimated distributions. We choose as test cases 
three lens redshifts (z = 0.3, 0.5 and 0.7) and use the red- 
shift distributions obtained from: (i) MEV estimates using a 
non-Bayesian approach; (ii) MEV with Bayesian prior; (iii) 
full posterior distributions; (iv) the prior distribution alone 
(based only on the magnitude limits with no colour infor- 
mation) . 

For the CDFS sample with 17 < ii < 25.5 we find values 
of critical density shown in Tabled] At low lens redshifts the 
differences between the densities are not dramatic (as is ex- 
pected as (3 changes most near the lens where the estimates 



all largely agree), but at higher lens redshifts the differences 
can be larger, most notably between the non-Bayesian esti- 
mates and the three other estimates at a lens redshift of 0.7, 
where the non-Bayesian estimate differs from the others by 
between 20% and 50%. In addition, although the Bayesian 
MEV estimates look generally reasonable, the fact that a 
large number of objects that other estimates place in front 
of the lens are classified instead as background objects could 
lead to these "misclassified" objects diluting the shear sig- 
nal and causing the lens mass to be underestimated, despite 
getting an apparently reasonable estimate of E cr it in the 
above calculation (such effects are investigated below with 
a Monte-Carlo simulation). 

The majority of the disagreement in the above test 
arises from the R > 23 objects, as can be seen when the 
critical densities for brighter objects alone (17 < R < 23) 
are considered. In this case, where the colours of the objects 
are the primary source of information rather than the prior, 
the first three distributions all provide very similar results, 
but the prior alone generally disagrees slightly. The excep- 
tion is the z = 0.7 lens, where the background sources are 
all in the tail of the redshift distribution and are close to the 
lens, so that slight changes in the estimation of n(z) have 
a dramatic effect owing to the steep variation in f3 near the 
lens redshift. 

We can use a Monte-Carlo simulation to test the ef- 
fects of poor redshift estimation and the resulting change 
in mass estimates owing to errors in individual values of j3 
and to foreground/background errors. We produce a Monte 
Carlo simulation of COMBO-17 5-band data in which 10,000 
objects are drawn randomly from the template set and as- 
signed an _R-band magnitude according to the prior distri- 
bution. Each object has its UBVRI fluxes calculated from 
the template colours and the assigned ii-band magnitude, 
and each of the five fluxes is scattered assuming a Gaussian 
error typical for a background- limited signal in COMBO-17. 

After estimating redshifts for this simulated dataset, we 
have a set of simulated objects with a 'true' redshift Zi for 
even the faintest objects, along with an estimated photomet- 
ric redshift z csti . We then assign each object a convergence 



(11) 



t(Zi) 



for a fixed surface mass density E, and where foreground 
objects are given a convergence of zero. 

Next, we calculate an estimate of the surface mass den- 
sity that would be inferred from its convergence and esti- 
mated redshift value, 



Ei — E cr it (Zesti )^i 



(12) 



Ds ost . > Dd, and with a final overall estimate of the surface 
mass density £ being determined as 



(EiWi) / (wi) . 



(13) 



The weighting Wi is a foreground/background parameter 
and is 1 in the case of MEV estimates where D s > Dd and 
elsewhere, and in the case of a posterior distribution or a 
prior is given by 



p(z)dz. 



(14) 
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MEV 


MEV 


Full 


Prior 




non-Bayesian 


Bayesian 


posterior 


alone 


0.3 


1.05 


1.02 


1.03 


1.01 


0.5 


1.01 


1.12 


1.01 


0.99 


0.7 


1.17 


1.22 


1.03 


0.99 


0.9 


1.61 


1.23 


1.01 


1.04 



Table 4. Surface mass densities (ratio to true value) for four 
distribution estimates and four lens redshifts from simulated data, 
17 < R < 25.5. 



Resulting mass estimates in each case are given in Ta- 
ble £1] for lenses at z — 0.3, 0.5, 0.7 and 0.9. The prior alone 
performs much better in this case as the 'true' redshift distri- 
bution is drawn directly from it. The unusually good result 
for the non-Bayesian MEV estimates at z = 0.5 may be due 
to various effects of misestimations cancelling out by chance. 
The dramatically poor result for the z — 0.9 lens for these 
estimates suggest that this choice would be a poor one in 
general. Even the Bayesian MEV estimates show significant 
misestimations of the surface mass density and can't there- 
fore be recommended as an approach unless the problematic 
fainter objects are excluded (at R < 23 the Bayesian MEV 
estimated densities are all within 2% of the true value except 
for a lens redshift of 0.9, whereas the non-Bayesian MEV es- 
timated densities deviate by more than 2% for all the lens 
redshifts) . 

When the reconstructed mass densities are dramati- 
cally incorrect they overestimate. This is understandable if 
the numbers at the highest redshift are underestimated, as 
is seen in the MEV estimations at low signal-to-noise in 
Fig. 1101 Galaxies in this region have their redshifts underes- 
timated, and as a result the calculated lensing mass must be 
higher to produce the same lensing effect at a lower distance 
from the lens. 

For the faintest galaxies in a real survey however, the 
shape measurements will also be noisy, and will be down- 
weighted, which will generally reduce the effect of photo- 
metric redshift errors. This reduction in the likely errors 
compared to the Monte Carlo simulations given here will 
not be of as much benefit for those lensing systems at high 
redshift, since the background sources in this case will all 
tend to be fainter and have largely comparable weightings. 

We return briefly to the issue raised in Section 13.41 
where the effect of noise on the measurements of the GLFs 
was considered. Using the same priors as used in that section 
(a varied by ±1<t and <j)*{z = 1.1) for red galaxies varied by 
±1<t) we recalculate the surface mass densities of Table [I] 
The results are given in Table 03 The differences are still 
small (a few tenths of a percent difference in surface mass 
densities between the various priors) and whilst the error on 
the prior may be larger and the inaccuracy in the prior may 
be large enough to have appreciable effects, they are still 
likely to be much smaller than those effects that arise from 
a non-Bayesian or MEV estimation approach. 

We also return to the issue of biases in redshift estimates 
and issues of higher order moments of the posterior proba- 
bility distribution. Section ^, 3. 3l eave estimates of the bias of 
photometric redshifts, expressed as (l+Zphoto) = b(l+z spcc ). 
Fitting to the VVDS sample at 0.1 < z < 1.0 gave b = 0.978, 
which would lead to estimated redshifts generally falling 
slightly below the true redshift, and therefore lead to a slight 
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Unaltered 


a + la 


a — la 


0* + la 


<f>* - 1 


0.3 


1.028 


1.029 


1.028 


1.029 


1.029 


0.5 


1.014 


1.015 


1.014 


1.015 


1.014 


0.7 


1.026 


1.030 


1.028 


1.031 


1.027 


0.9 


1.008 


1.009 


1.011 


1.011 


1.009 



Table 5. Surface mass densities (ratio to true value) for full poste- 
rior distribution estimates and four lens redshifts from simulated 
data, 17 < R < 25.5, using priors calculated from altered GLF 
parameters. 

overestimation in lensing masses. In the case of a single 
source at z = 1, for lens redshifts of 0.3, 0.5 and 0.7 the 
misestimations are 1.2%, 2.8% and 6.7% respectively, much 
lower than those arising from choosing an alternative esti- 
mation technique (although this bias was measured only for 
the restricted range of objects described earlier, and will be 
more of a problem at z > 1). 

Higher order moments of the posterior probability are 
also a concern, since the critical densities we estimate are 
based on (3 — (D s /Dd s ), and (/?) is not proportional to (z). 

However, since /3 approaches a constant as D s increases, 
variations in the distribution at high redshift do not impact 
the estimated lensing mass as much as estimates at low red- 
shift. As a result, for low lens redshifts the shape and width 
of the distributions are not a major concern, since close to 
the lens where /3 is varying most rapidly the sources are also 
at low redshift and, being generally brighter, have tightly 
constrained distributions. At high lens redshifts the distri- 
bution widths and shapes are much more of a concern as 
they will be broader and cover regions of significant vari- 
ation in j3. In these cases estimates of the bias alone may 
prove insufficient and the best approach to estimating re- 
sulting errors would likely be further simulations. 

The conclusion from these tests is that the full poste- 
rior approach is capable of providing lensing estimates that 
agree with previously used photometric techniques in the 
R < 23 regime, and yet also agrees for fainter samples with 
the modelled redshift distributions, providing consistent es- 
timated lens masses over a broad range of lens redshifts. 
In these tests Bayesian MEV estimates perform better than 
pure likelihood estimates, but are likely to be less good in 
3-D lensing analyses. In such analyses, the distribution of 
mass is estimated as a function of redshift using the photo- 
metric redshifts of the observed galaxies. Hence we expect 
the estimated mass distribution will be even more sensitive 
to any systematic poor estimation of redshifts than the tests 
that we have described above. 



7 CONCLUSIONS 

Extraction of the full information content of future gener- 
ations of weak lensing surveys will require extensive use of 
photometric redshifts, applied to both 2D and 3D analyses. 
Weak lensing signals are concentrated towards the magni- 
tude limits of surveys, and yet it is here that photometric 
measurement errors make photometric redshift estimation 
the most unreliable. These errors lead not only to a broad 
distribution of redshift uncertainty, but also to increasing 
severity of the degeneracy between galaxy type and redshift 
that frequently leads to multiple solutions for an estimated 
redshift. Furthermore, the effect of measurement errors is to 
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make the sample distribution of estimated redshifts differ- 
ent from the true distribution of redshifts, which inevitably 
leads to bias in the values of cosmological parameters esti- 
mated from the overall sample. 

In this paper we have considered the Bayesian solution 
to this problem, and have shown that using the full Bayesian 
posterior probability distribution for estimated redshifts al- 
lows such biases to be eliminated, provided a prior distri- 
bution, here based on knowledge of the galaxy luminosity 
function and its evolution, is known. Using photometric and 
redshift data from the COMBO-17 sample, we have com- 
pared the full Bayesian approach with pure likelihood esti- 
mation and minimum error variance (MEV) methods and 
show that, although there is little to choose between meth- 
ods at bright magnitudes, the full Bayesian method is sig- 
nificantly less biased at faint magnitudes. Furthermore, the 
Bayesian method allows the sample distribution of estimated 
redshifts to tend smoothly towards the prior distribution at 
the faintest limits of a survey. 

Applying the method to simulated weak lensing signals, 
we show that reconstruction of lensing mass density in the 
presence of photometric redshift errors can lead to biases 
in excess of 20 percent in 2D lensing analyses unless the 
Bayesian method is adopted. We anticipate that 3D lensing 
analyses will be even more susceptible to this effect, but 
we postpone to a future paper a full incorporation of the 
Bayesian probability approach into 3D lensing analysis. 
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